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TITLE: METHODS OF DETECTING INTERACTIONS BETWEEN 
PROTEINS, PEPTIDES OR LIBRARIES THEREOF USING 
FUSION PROTEINS 

5 INVENTORS: Andrew D. Hamilton, Indraneel Ghosh, and Lynne Regan 

RELATED APPLICATIONS 

This application claims the benefit of priority of U.S. Provisional Application 
60/203,712, filed on May 12, 2000. 

10 

FIELD OF THE INVENTION 

The present invention is related to the reassembly of fiision peptides into a 
functionally active protein complex. Specifically, the i>resent invention provides a 
method of forming peptide complexes that associate through the combination of helical 
15 • domains to form an antiparallel leucine zipper. The present invention is also related to 
the use of assays to investigate protein-protein interactions. The assays of the present 
invention involve the association of fiision proteins comprising GFP firagments and 
heterologous polypeptides into fimctionally active GFP that exhibits fluorescence. 

BACKGROUND OF THE INVENTION 

All publications and patent applications herein are incorporated by reference to 
the same extent as if each individual publication or patent ^plication was specifically 
and individually indicated to be incorporated by reference; 

Green Fhmrescent Protem 

Green fluorescent protein (GFP), a relatively small protein comprising 238 amino 
acids, is the ultimate source of fluorescent light emission in the jellyfish ^egworea 
victoria. The gene for GFP was first cloned by Prasher et al. (1992, Gene, 1 1 1 :229-233), 
and cDNA for the protein produces a fluorescent product identical to that of native 
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protein when expressed in prokaryotic (E. coU) and eucaiyotic (C. elegans) ceUs (Chalfie 
a/., 1994, Science, 263, 802-805). 

The GFP excitation spectrum shows an absorption band (blue light) maximally at 
395 nm with a minor peak at 470 nm, and an emission peak (green light) at 509 nm. The 
5 longer-wavelength excitation peak has greater photostability then the shorter peak, but is 
relatively low in amplitude (Chalfie et al, 1994, Science, 263:802-805). The crystal 
structure of the protein and of several point mutants has been solved (Ormo et al.. 1996, 
Science 273, 1392; Yang et al.. Nature Biotechnol. 14, 1246). The fluorophore, 
consisting of a tripeptide at residues 65-67, is buried inside a relatively rigid beta-can 
10 structure, where it is ahnost completely protected from solvent access. The GFP 
absorption bands and emission peak arise from an internal p-hydroxybenzylidene- 
imidazoHdinone chromophore, which is generated by cyclization and oxidation of the 
tripeptide sequence Ser-Tyr-Gly sequence at residues 65-67 (Ck)dy et al., 1993, 
Biochemistry 32:1212-1218). 

GFP fluorescence in procaryotic and eucaryotic cells does not require exogenous 
substrates and cofactors. Accordingly. GFP is considered to have tremendous potential in 
methods to monitor gene expression, ceU development, or as an in situ tag for fusion 
proteins (Heim et al, 1994. P.N.A.S. USA, 91.12501-12504). Chalfie and Prasher, WO 
95/07463 (March 16, 1995), describe various uses of GFP, including a method of 
examining gene expression and protein localization in Uving cells. Methods are described 
wherein: 1) a DNA molecule is introduced into a cell, said DNA molecule having DNA 
sequence of a particular gene linked to DNA sequence encoding GFP such that the 
regulatory element of the gene will control expression of GFP; 2) the ceU is cultured in 
conditions permitting the expression of the fused protdn; and 3) detection of expression 
25 of GFP in the cell, thereby indicating the expression of the gene in the cell. Methods such 
as those described by Chalfie and Prasher are advantageous compared to previously 
reported methods which utiUzed p-galactosidase fiision prbteins (Silhavy and Beckwith, 
1985. Microbiol. Rev., 49, 398; Gould and Subramani, 1988. ^w/. Biochem., 175, 5; 
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Stewart and Williams, 1992, J. Gen. MicrobioL 138,1289) or luciferases, in that the need 
to fix cell preparations and/or add exogenous substrates and cofactors is eliminated. 

GFP is a valuable marker for intracellular protein localization. However, the 
fusion of GFP with structural proteins can alter their properties, resulting in loss of fusion 

5 protein localization, decreased GFP fluorescence or both. The fluorescence of this 
protein is sensitive to a number of point mutations (Phillips, G. N., 1997, Curr. Opin. 
Struct Biol 7, 821-27). The fluorescence appears to be a sensitive indication of the 
preservation of the native structure of the protein, since any disruption of the structure 
allowing solvent access to the fluorophoric tripeptide will quench the fluorescence. 

10 Abedi et al (1998, Nucleic Acids Res,, 26, 623-30) have inserted peptides between 

residues contained in several GFP loops. Inserts of the short sequence LEEFGS (SEQ ID 
NO: 9) between adjacent residues at 10 intemal insertion sites were tried. Of these, 
inserts at three sites, between residues 157-158, 172-173 and 194-195 gave fluorescence 
of at least 1% of that of wild type GFP. Only inserts between residues 157-158 and 172- 

15 173 had fluorescence of at least 10% of wild type GFP. 

Protein Reassembly using Leucine Zipper 

The unassisted reconstitution of proteins fix>m peptide fragments has been 
demonstrated for several proteins; including ribonuclease QEUchards et al, 1959, X Biol. 

20 Chem. 234, 1459-1465), chymotrypsin hihibitor-2 (Gay et al, 1994, Biochemistry, 33, 
7957 -7963), tRNA synthetases (Shiba et a/., 1992; Proc Natl Acad. Sci. U.SJL., 89, 
1880-1884), and inteins (Southworth, et al, 1998, EMBO J., 17, 918-926). Protein 
reassembly has thus become an important avenue for imderstanding enzyme catalysis 
(Richards et al, 1959, J. Biol Otem. 234, 1459-1465), protein folding (Gay et al, 1994, 

25 Biochemistry^ 33, 7957 -7963), and protein evolution (Shiba et al^ 1992, Proc. Natl 
Acad. Set U.S.A., 89, 1880-1884). Recently, assisted protein reassembly or "firagment 
complementation" has been applied to the in vivo detection of protein-protein interactions 
in such systems as dihydrofolate reductase (DHFR) (Pelletier et al, 1998, Proc. Natl 
Acad. Set U.S.A., 95, 12141-12146; Remy et al, 1999, Proc. Natl Acad. Set U.S.A., 96, 
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5394-5399; Pelletier et aJ., 1999, Nat Biotedmol, 17, 683-690), ubiquitin (Karimova et 
al, 1998, Proc. Natl Acad. Sci. U.S.A., 95, 5752-5756; Johnsson et al, 1994, Proc. Natl 
Acad. Sci. U.S.A., 91, 10340-10344), and P-galactosidase (Rossi et al, 1997, Proc. Natl 
Acad. Sci. U.S.A., 94, 8405-8410). These reassembly processes are contingent upon the 
5 proper choice of a dissection site within a protein and can be aided by techniques such as 
limited proteolysis,-circular permutation (Bairde/a/., 1999, Proc. Natl Acad. Set U.S.A., 
96, 1 1241-1 1246; Topell et al, 1999, FEBSLett., 457, 283-2S9; Zhang et al, 1993, 
Biochemistry, 32, 12311-12318; Regan, L., 1999, Curr. Opin. Struc. Biol, 9, 494- 
499)and loop insertions (Abedi et al, 1998, Nucleic Acid Res., 26, 623-630; Nobuhide et 

10 al, 1999, FEBSLett., 453, 305-307). 

The dissection and subsequent reassembly of a protein torn peptidic fragments 
provides an avenue for controlling its tertiary structure and hence its function. Although 
a majority of leucine zippers associate in a parallel fashion, recent examples of both 
naturally occurring and designed antiparallel leucine.zippers have appeared in the 

15 Uterature (Lupas, A., 1996, D-ends Biochem. Sc. 21, 375-382; Kohn, W. D. et al, 1997. 
S. J. Biol Chem. 272, 2583-2586; Bryson, J. W. et al. 1995, Science, 270, 935-941 ; 
Oakley M. G. et al, 1998, Biochemistry, 37, 12603-12610; Oakley, M. G. et al., 1997, 
Biochemistry, 36, 2544-2548). However, the prior art does not disclose the attachment of 
antiparaUel leucine zippers to polypeptide fragments to form fiision proteins for 

20 reassembling the polypeptide fragments into functional proteins. 

In contrast to paraUel zippers, the antiparallel zippers are oriented in an opposite 
direction. AntiparaUel Zippers have the advantage of occurring less frequently in natural 
proteins. Thus, antiparaUel leucine zippers wiU interfere to a lesser extent with natural 
ceUular proteins than paraUel leucine zippers. AntiparaUel attachment of leucine zippers 

25 to protein fragments (between a dissected peptide bond of the parent protein) requires a 
shorter amino acid linker region. As shown by the inventors of the present invention, as a 
preferred embodiment, a linker having 4-6 amino acids is sufficient (see Examples). 
Similar attachment of paraUel leucine zippers would require > 10 amino acids to span the 
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necessary distance. The long unstractured linkers would be prone to proteolytic cleavage 
and be less stable in in ^nvo assays. 

Katz et al (1998, BiotechnigueSy 25, 298) describe a targeting approach based on 
noncovalent heterodimerization of GFP and cytoplasmic structural proteins using a 
5 leucine zipper designed to form high-affinity heterodimers. The complexes localized 
accurately to specific sites within cells, providing selective fluorescence labeling of 
subcellular structures such as microfilaments or focal contacts. 

Protein-Prot ein Interaction Assays 

1 0 The association and dissociation of proteins is cracial to all aspects of cell 

Amotion. Examples of protein-protein interactions are evident in horaiones and their 
respective receptors, in intracellular and extracellular signalling events mediated by 
proteins, in enzyme substrate interactions, in intracellular protein trafficking, in the 
formation of complex structures like ribosomes, viral coat proteins, and filaments, and in 

1 5 antigen-antibody interactions. Intracellular assays for detection of protein interactions 
and identification of their inhibitors have received wide attention with the completion of 
the human genome sequence. 

US Patent 5,585,245 discloses a first fiisibn protem conq>rising an N-temiinal 
subdomain of ubiquitin, fiised to a non-ubiquitin protein or peptide and a second fiision 

20 protein comprising a C-terminal subdomain of ubiquitin, fiised to the N-terminus of a 

non-ubiquitin protein or peptide. The patent discloses the use of these fiision proteins for 
studying protein-protein interactions.. When contacted A<ith dne another, provided that the 
non-ubiquitin proteins or peptides interact (bind) with one another, the N- and C-teiminal 
ubiquitin subdomaias associate to reconstitute a quasi-native ubiquitin moiety which is 

25 recognized and cleaved by ubiquitin-specific proteases. However, this assay requires the 
use of additional cellular factors, such as the ubiquitin-specific proteases, for detection of 
protein-protein interaction. Thus, this assay is not feasible for high throughput screening 
of cDNA libraries. 
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US Patent 5,362,625 discloses omega-acceptor and omega-donor polypeptides 
(comprising about two-lhirds and one-third of the p-galactosidase molecule amino and 
caiboxyl termini, respectively), prepared by recombinant DNA techniques, DNA 
synthesis, or chemical polypeptide synthesis techniques, which are capable of interacting 
5 to form an active enzyme complex having catalytic activity characteristic of P- 
galactosidase. The patent also describes the use ofthese polypeptides in enzyme 
complementation assays for quaUtative and quantitative detemiination of a suspected 
analyte in a sample. 

The yeast two-hybrid system for detecting protein-protein interactions in 
10 Saccharomyces cerevisiae (Fields and Song, 1989, Nature. 340:245-246; U.S. Pat. No. 
5,283,173 by Fields and Song) is well known in the art This assay utilizes flie 
reconstitution of a transcriptional activator like GAL4 (Johnston, 1987, Microbiol Rev., 
51 :458-476) tiirough the interaction of two protein domains that have been fused to the 
two functional units of the tianscriptional activator: the DNA-binding domain and the 
activation domain. This is possible due to the bipartite nature of certain transcription 
factors like GAL4. Being characterized as bq)arfite signifies that the DNA-binding and 
activation functions reside in separate domains and can function in trans (Keegan et al, 
1986, Science 231 :699-704). The reconstitution of flie transcriptional activator is 
monitored by flie activation of a reporter gene like flie lacZ gene tiiat is under flie 
influence of apromoter fliat contains abinding site- (Upstream Activating Sequence or 
UAS) for flie DNA-bmding domain of flie transcriptional activator. This method is most 
commonly used eiflier to detect an interaction between two known protems (Fields and 
Song, 1989, Nature. 340:245-246) or to identify interacting proteins from a population 
fliat would bind to a known protein (Durfee et al., 1993, Genes Dev.. 7:555-569; Gyuris et 
25 al.. 1993, Cell, 75:791-803; Harper et al. 1993, Cell, 75:805-816; Vojtek et al, 1993, 

Celh 74:205-214). Like flie ubiquitin system, additional factors are required for detection 
of flie protein-protein interaction. Additionally, in ttie yeast two-hybrid system, flie 
protein interaction must occur in the nucleus of the yeast 
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WO 98/34120 describes protein fragment complementation assays for detecting 
bimolecular interactions. The assays comprise coexpression effusion peptides consisting 
of N- and C-terminal fragments of murine DHFR frised to GCN4 leucine zipper 
sequences in E. coli to form colony. Colony formation only occurs when both DHFR 
5 fragments are present and contain leucine-zipper foraiing sequences. The published 
patent application contemplates the use of the assay to study molecular interactions 
including protein-protein, protein-DNA, protein-RNA, protein-carbohydrate, and protein- 
small molecule interactions, and for screening cDNA libraries for binding of a target 
protein with unknown proteins or libraries of small organic molecules for biological 
10 activity. WO 98/34120 also contemplates the use of GFP in the protein fragment 

complementation assay. However, the pubUshed patent application does not suggest 
fusing antiparallel leucine zipper to DHFR or GFP foir reconstitution. GCN4 disclosed in 
the pubUshed application and routinely used by skilled artisan to reassemble proteins 
especially in the yeast two hybrid system, is a parallel zipper. Antiparallel and parallel 
1 5 zippers orient proteins in opposite direction; thus, it is not predictable that an antiparallel 
zipper can be substituted for a parallel zipper. 

Additionally, all protein reassembly strategies disclosed in WO 98/34120 are for 
reassembly of multi domain proteins such as DHFR. The two dissected domains of 
DHFR can fold separately and only need to be brougjit into close proximity by attached 
20 proteins. There is no precedent for rational dissection of a single domain protein such as 
GFP that can be accomplished based upon the WO 98/34120.. WO 98/34120 does not 
teach how to rationally dissect single domain proteins that can be subsequently 
reassembled. Finally, the abiUty to identify and characterize appropriate sites for 
dissecting a single domain protein is not validated or demonstrated in WO 98/34120. 
25 US Patent 6,180,343 relates to the use of fluorescent proteins, particularly green 

fluorescent protein (GFP), in ftision constructs with random and defined peptides and 
peptide libraries, to increase the cellular expression levels, decrease the cellular 
catabolism, increase the conformational stability relative to linear peptides, and to 
increase the steady state concentrations of the random peptides and random peptide 
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library members expressed in cells for the purpose of detecting the presence of the 
peptides and screening random peptide libraries. The patent does not contemplate the use 
of antiparallel leucine zipper for reconstituting GFP nor the use of peptides that associate 
with each other to reconstitute GFP and to provide a detection signal. 

5 

SUMMARY OF THE INVENTION 

The present invention provides protein complexes comprising a first and second 
peptide, each of said peptides being joined, operably linked, or fused to a heterologous 
helical domain, said helical domains being noncovalently associated to form an 

10 antiparallel leucine zipper. The peptides of the protein complexes form a functional 
signaling moiety such as a reporter, a marker, or a biosensor upon non-covalent 
association of the helical domains into an antiparaUel leucine zipper. In one embodiment, 
each of the peptides is joined to a hehcal domain via a linker, hi a preferred embodiment, 
each of the heUcal domains comprises an amino acid sequence as set forth in SEQ ID NO: 

15 1 or SEQ ID NO: 2. Preferably, each of the first and second peptides comprises a distinct 
portion of green fluorescent protein (GFP). 

In one aspect, the present invention provides fusion proteins comprising a peptide 
and a hehcal domain, said helical domain forming an antiparallel leucine zipper when it 
noncovalently associates with a complementary hehcal domain. The helical domain is a 

20 heterologous or distinct protein or polypeptide frajgmenti relative to the peptide of flie 
fusion protein. The fusion protein may further comprise a linker moiety interposed 
between the peptide and tiie hehcal domain. In a preferred embodiment, the peptide 
comprises a peptide derived firom green fluorescent protem (GFP). 

.In another aspect, the present mvention provides nucleic acids encoding fusion 

25 protems comprismg a peptide and hehcal domain, said helical domam forming an 

antiparallel leucine zipper when it noncovalently associates with a complementary hehcal 
domain. 

The present invention provides a method of assembhng a protem complex 
comprising (a) providing first and second hehcal domains that non-covalentiy associate 
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to fonn an antiparallel leucine zipper; (b) providing first and second peptides; (c) 
producing fusion proteins by separately fusing said first helical domain to said first 
peptide and said second helical domain to said second peptide; and, (d) allowing the 
fusion proteins to form a protein complex mediated by the non-covalent association of the 
5 first and second helical domains into an antiparallel leucine zipper. The first and second 
peptides are distinct peptides. Preferably, they are distinct peptides derived firom GFP, 
such that they comprise different GFP fragments. 

In one embodiment of the disclosed method of assembling a protein complex, the 
protein complex comprises a signaling moiety and the helical domains comprise a leucine 
10 rich hydrophobic core. The helical domains may further comprise acidic residues and 
basic residues. The helical domains may further comprise a buried asparagine residue. 
The pair of helical domains preferably have the amino acid sequences as set forth in SEQ 
ID NO: 1 and SEQ ID NO: 2. In an alternative embodiment of the method, the step of 
producing the fusion proteins further comprises interposing a linker moiety between die 
15 peptide and the helical domain. 

The present invention also provides a method of identifying a polypeptide that 
interacts with a known polypeptide comprising (a) producing a first fusion protein 
comprismg the known polypeptide linked to a first GFP firagmen^ (b) produdng a second 
fusion protein comprising a test polypeptide linked to a second GFP fragment, wherein 
20 association of the first and second GFP fragments results m a GFP that exhibits detectable 
fluorescence; (c) allowing the first fUsion protein to associate with the second fusion 
protem to form a complex mediated by the non-covialent association of the known 
polypeptide and test polypeptide; and, (d) detecting whether, or to what extent, 
association of first and second GFP fragments occcurs, wherein association of GFP 
25 indicates that the test polypeptide interacts with the known polypeptide. Preferably, the 
first GFP peptide is NGFP and the second GFP peptide is CGFP. 

In one aspect, the present invention provides a method of identifying a 
polypeptide that interacts with a known polypeptide comprising (a) producing a nucleic 
acid encoding a fusion protein comprising the known polypeptide linked to a first GFP 
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fragment; (b) producing a plurality of nucleic acids encoding fusion proteins comprising a 
test polypeptide linked to a second GFP fragment, wherein association of the first and 
second GFP fragments results in a GFP that exhibits detectable fluorescence; (c) 
cotransforming or cotransfecting the nucleic acids of steps (a) and (b) into a host cell for 
5 expression of the encoded fusion proteins; (d) selecting colonies that exhibit fluorescence; 
and, (e) culturing the selected colonies to identify the test polypeptides that interact with 
the known polypeptide. 

In a preferred embodiment of the constructs and methods of the present invention, 
the first GFP peptide is NGFP and the second GFP peptide is CGFP. Also, preferably, 
1 0 the nucleic acids of step (b) of the foregoing identification step are produced in the form 
of a combinatorial library. 

In another aspect, the present invention provides a method of identifying a 
molecule that inhibits the activity of a known protein comprising (a) producing a first 

fusion protein comprising a first known polyp€5)tide linked to a first GFP fragment; (b) 
15 producing a second fusion protein comprising a second polypeptide linked to a second 
GFP fragment, wherein the second polypeptide is known to interact with the first 
polypeptide and wherein association of the first and second GFP fragments results in a 
GFP that exhibits detectable fluorescence; (c) allowing the first fusion protein to associate 
with the second fusion protein to form a GFP complex mediated by the non-covalent 
20 association of the first and second polypeptide; (d) incubating a test molecule with the 
GFP complex; and, (e) detecting disassembly of the complex, wherein disassembly of the 
complex indicates that the test molecule inhibits the activity of the known protein. 
Preferably, the first GFP peptide is NGFP and the second GFP peptide is CGFP. 

The present invention also contemplates a method of detecting protein-protein 
25 interactions comprising (a) producing a first fiision protein comprising a known 
polypeptide linked to a first GFP fragment; (b) producing a second fiision protein 
comprising a test polypeptide linked to a second GFP fragment, wherein association of 
the first and second GFP Augments results in a GFP that exhibits detectable fluorescence; 
(c) allowing the first fusion protein to associate with the second fusion protein to fonn a 
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complex mediated by the non-covalent association of the known polypeptide and test 
polypeptide; and, (d) detecting reassembly of GFP, wh^ein reassembly of GFP indicates 
that the test polypeptide interacts with the known polypeptide. 

A related method may further comprise obtaining nucleic acids encoding the first 
and second fusion proteins and cotransfecting or cotransforming the nucleic acids into a 
cell to obtain the first and second fusion protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the strategy for antiparallel leucine zipper directed protein 
reassembly of GFP (ECraulis, P. J., 1991, J. AppL Crystallog., 24, 946-950). Both the 
ribbon and topographical stmctures are depicted: GFP is shown in green, NZGFP is 
shown in blue and CZGFP is shown in red. The sequences of the designed leucine 
zippers, NZ and CZ, are ALKKELQANKKELAOLKWELQALKKELAQ (SEQ ID NO: 
1) and EQLEKKLQALEKKLAQLEWKNQALEEOQAQ (SEQ ID NO: 2) respectively. 

Figure 2 shows fluorescence binding isotherm for the interaction of NZGFP with 
CZGFP monitored at 505 nm. Inset shows the normalized fluorescence excitation and 
emission of the reconstituted NZGFP.CZGFP complex. 

Figure 3 shows in vitro reconstitution of GFP demoristrated by (a) green 
fluorescent BL21(DE3) cells and the corresponding SDS gels of (b) lanel: MW markers; 
lane 2: protein from cotransformed green colony; and lane 3: protein from colony 
containing only NZGFP plasmid and (c) lanel: MW markers; protein from cotransformed 
green colony; and lane 3: protein from colony containing only CZGFP plasmid. 

Figure 4 shows the antiparallel leucine zipper pairs attached to CGFP and NGFP 
are shown in heUcal wheel representations. The Lys (K) residues are colored blue and 
complementary Glu (E) residues are colored red. The pairs a and b are electrostatically 
matched and the pairs c and d are electrostatically mismatched. The inset shows restreaks 
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of single Escherichia coli colonies corresponding to each pair. EK-CGFP is the same as 
CZGFP, and EK-NGFP is the same as NZGFP. 

Figures 5 A-C show fluorescence based selection. A. The "prey" leucine zipper 
attached to CGFP is randomized (X) at the e and g positions of the helix with either Lys 
(K) or Glu (E). and the "bait" leucine zipper attached to NGFP contains only Glu (E) 
residues at both e and g positions. B. SDS gel showing protein expression profiles of 10 
cotransformed green fluorescent colonies from the selection. C. Tabulation of the 
residues selected in the Ubrary leucine zipper (XX-CGFP) by screenmg for fluorescence 
of cotransfoimed Escherichia coli cells. 

Figure 6 shows fluorescence spectra for the dominant negative inhibition of 
NGFP/CGFP complex formation as a function of added EK. leucine zipper inhibitor (SEQ 
ID NO: 1). hiset shows the inhibition profiles of the EK leucme zipper, NGFP, and 
CGFP peptides. 

DETAILED DESCRIPTION 

1. General Description 

The present invention is based on the finding that the dissection and subsequent 
reassembly of a protein firom peptidic fiagments provides an avenue for controlling the 
protein's tertiary structure and hence its function. . 

The present invention is based in part on the surprising discovery of a general 
method for the reassembly of protein firagmehts mediated by the non-covalent association 
of antiparallel leucine zippers (Lupas, A., 1996, Trends Biochenu Sc. 21, 375-382; Kohn, 
W. D. etal., 1997, J. Biol Chem. 272, 2583-2586; Biyson, J: W. etal. 1995. Science, 
270, 935-941). Specifically, flie present mvention discloses a strategy for the noncovalent 
reconnection of the N- and C- termini of a dissected surface loop of a protein by means of 
antiparallel leucine zippers (Figure 1) (Krauhs, P. J., 1991, J. Appl. Crystallog., 24, 946- 
950). The present invention demonstrates the successful appUcation of this 
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oligomerization strategy, both in vitro and in vivo, to the 238 residue green fluorescent 
protein (GFP) from Aeguorea victoria (Tsien, R. Y., 1998, Annu. Rev. Biochem., 67, 
509-544). GFP provides an easily testable system for correct reassembly by virtue of its 
autocatalytically generated fluorescence, which is intimately linked to its properly folded 
5 structure (Ormo, M. et al, 1996, Science, 273, 1392-1395; Reid, B. G. et al. 1997, 
Biochemistry, 36, 6786-6791; Miyawaki, A. et a/., 1997, Nature, 388, 882-887; 
Miesenbock, G. et al, 1998, Nature, 394, 192-195;). 

The present invention is alscbased in pgrt on tlie discovery of an effective 
strategy involving linking fragments of an enzyme to potentially interacting protein- 
ic partners such that functional enzyme reassembly only occvirs on formation of a strong 

protein-protein complex. In one aspect, the present invention establishes the selectivity of 
the GFP reassembly mediated selection of interactiiig protems (GRIP) assay and applies it 
to the in vivo colorimetric selection of complementary leucine zipper pairs from 
combinatorial libraries in Escherichia colL In another aspect, the present invention 
15 demonstrates the ^plicability of the GRIP assay to monitor the dismption of protein- 
protein interactions by a dominant negative approach. Accordingly, the present invention 
provides an assay system that has the potential to moiiitor protein-protein interactions in 
their natural environment within a cell and are not limited to the nucleus as are classic 
yeast two-hybrid systems (Fields, S:et al, 1989, Nature, 245-246), 

20 

2. Definitions 

As used herein, "active protein complex" refers to a protein complex comprising 
two or more peptides and retaining substantially all the functional activity of the native 
protein from which the peptides are obtained. 
25 As used herein, "covalent bond'* refers to an interatomic bond characterized by 

sharing of electrons. 

As used herein, "fusion protein" or "chimeric protein" refers to a hybrid protein, 
which consists of two or more proteins, or fragments thereof, linked together covalently. 
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A fusion protein may comprise two or more peptides or proteins from different animals, 
origins, or species. 

As used herein, ''helical domain" refers to a protein or polypeptide fragment or a 
peptide having a a helix or a coiled configuration. 
5 As used herein, ''heterologous protein or peptide" refers to a protein or peptide 

derived fi'om a different origin, animal, or species. Heterologous proteins or peptides are 
not operably linked in their naturally occurring or native form. 

As used herein, "noncovalent association" refers to molecular interactions that do 
not involve an interatomic bond. Noncovalent interactions involve, for example, ionic 
10 bonds, hydrogen bonds, hydrophobic interactions, and van der Waals forces. 

Noncovalent forces may be used to hold separate polypeptide chains together in proteins 
or in protein complexes. 

As used herein, ''protein complex" refers to a combination of two or more proteins 
into a larger molecule without covalent bonding. 
15 As used herein, '^random peptide" refers to an oligomer composed of two or more 

amino acid residues and constructed by a means with which one does not preselect the 
complete sequence of a particular oligomer. 

As used herein, "random peptide Ubrary" or a '^combinatorial library* refers a 
library conaprising not only of a set of recombmant DNA vectors (also called 
20 recombinants) that encodes a set of random peptides, but also of random peptides 
encoded by those vectors, as well as the fusion proteins containing those random 
peptides. 

As used herein, "signaling moiety" refers to a moiety that acts to cause an action 
such as a signal. The moiety may signal as a result of an enzymatic reaction, Ught 
25 absoiption, or other means. 

3. Specific Embodiments 

A, CtFP a System for Protein Reassemblv and Frair^ f^j ^ t Comp lementatin^ 

Assay 
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The present invention is based in part on the use of GFP as model for protein 
reassembly and fragment complementation based assays. GFP provides an ideal system 
for these assays because the reassembled protein autofluoresces and is easily visualized 
and amenable to fluorescence activated cell sorting (Tsien, R. Y., 1998, Annu. Rev, 

5 Biochem. 67, 509-544; Misteli, T. et al, 1997, Nat BiotechnoL 15, 961-964). GFP 
fluorescence does not require the addition of other cellular factors, substrates, or 
additional gene products from A. victoria. Moreover, GFP can be expressed and detected 
in various cells and organisms and is not localized to a specific organelle of a cell upon 
expression. Additionally, unlike the DHFR assay, detection of GFP expression is not 

10 dependent upon survival or death of host cells. Nor is the e^qpression of GFP dependent 
upon the addition of cofactors as in the P-galactoside assay or of other cellular 
components as in the ubiquitin assay. It is also not toxic to mammals and has been 
expressed in monkeys (Chan et al, 2001, Science^ 291, 309). Further, the multiple 
variants of GFP available for use in different organisms and cell-types makes it an ideal 

15 protein candidate for development of a general assay such as the GRIP assay described 
below. 

Various mutations in GFP leading to brighter emission following 488 nm 
excitation have been generated. Mutations in GFP which'shift the excitation maximum 
from 395 nm to about 490 nm have been reported by Delagrave et aL (1995, 

20 Biotechnology, 13,151) and Heim et aL (1995, Nature, 373, 663). Mutants with Ala, 
Gly, lie, Cys or Thr substituted for Ser65 have large shifts in excitation maxima, and 
fluoresce more intensely than wild-type protein when excited at 488 nm. The mutation of 
Ser65 to Thr or Cys has been observed to increase'by a factor of 6 the fluorescence of 
GFP following 488 nm excitation. Heim e/. a/. (1994, Proc. Nat'lAcad. Sci USA, 91, 

25 12501-12504) describe a mutant that fluoresces blue and contains a histidine in place of 
Tyr66. Delagrave et aL (1995, Bio. Technology, 13, 151-154) report on several ^e^i/orea 
GFP variants that showed red-shifted excitation spectra, shift in excitation maxima 
from 393 nm to 498 nm. Delagrave et aL hypothesize that co-expression of GFP and red- 
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shifted GFP (RSGFP) will enable the analysis of two proteins or promoters per cell or 
organism. 

U.S. Patent 6,096,865 describes GFP mutants with improved solubility properties 
at higher temperatures and are able to fluoresce at 37 ®C. Specifically, the patent 
5 provides a GFP mutant in which phenylalanine at original amino acid position 64 is 
replaced by a leucine. This mutant has the ability to fluoresce at 37 °C. Other mutants 
with altered spectra are disclosed by Heim et aL (1994, Proc. Nat'l Acad. Sci USA, 91, 
12501-12504 and 1995, Nature, 373, 663). 

The present invention contemplates the use of various GFP mutants in the 
10 protein complementation assay and protein reassembly assay described in detail below. 
The preferred GFP mutant is the sglOO GFP variant described below. 

B. Methods for Reassembly of Fragments intn a Fi m ctional Prntrin 

The present invention also is based in part on lbs discovery that an antiparallel 

15 leucine zipper is usefiil for in vitro reassembly of protein fragments into a functionally 

active protein. SpecificaUy, a GFP variant (sglOD) which has a single excitation and 

ranission maximum at 475 nm and 505 mn respectively, was dissected and refolded using 

an antiparallel leucine zipper. The GFP variant, sglOO, was dissected at a surface loop 

between residues 157 and 158. Apairof heUces, NZ andCZ (SEQ ID NO: 1 and 2), 

20 capable of forming an antiparallel leucine zipper was designed and fused to the dissected 

GFP fragments via linkers to form NZGFP (N-terminal GFP) and CZGFP (C-terminal). 

Under conditions routinely used for folding denatorefd GFP, NZGFP and CZGFP 

reassembled properly to form a functi -i pally active GFP/ The wavelengths, X^, for 

fluorescence excitation and emissio! . were identical to that ofthe parent GFP 
25 (Figure 2). 

The present invention is als • » *. on the discovery that an antiparallel 

leucine zipper is usefixl for in vivo r otein fragments into a fiinctionaUy 

active protein. Specifically, equimc irimids encoding NZGFP and CZGFP 

were transformed into E. coli cells. . . A green (Figure 3, panel a) were 
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selected and further cultured in liquid media for analysis of the protein expression pattern. 
As shown in figure 3, panels b and c, the green colonies expressed similar amounts of 
NZGFP and CZGFP, whereas the non-fluorescent colonies contained either NZGFP or 
CZGFP. Moreover, control cotransfonnation experiments withNGFP/CGFP, 
5 NGFP/CAFP, and NZFP/CGFP did not have any green colonies. Accordingly, the 
presence of both NZ and CZ leucine zippers are required to mediate GFP assembly in 
vivo and in vitro. 

The present invention contemplates the use of the antiparallel leucine zipper to 
refold, reconstitute, or reassemble proteins from peptides. Moreover, the abihty to 

10 reconstitute GFP firom its peptide fragments can be extended to an in vivo fragment 
complementation assay for the selection of antiparallel leucine zippers as has been 
demonstrated for parallel leucine zippers with DHFR (Pelletier, J. N. et aly 1999, Nat 
BiotechnoL, 17, 683-690). As described below, fragmented GFP can be used to study the 
in vivo interaction of protein-protein pairs which have their N and C termini in close 

15 proximity (Pelletier, J. N. et aL, 1998, Proc. Natl Acad. Set U.SA., 95, 12141-12146). 
More generally, the protein reassembly strategy of the present invention may have 
applications such as the selective isotopic labeling of one fragment of a large protein for 
NMR analysis, or the mutagenesis of a limited region of a protein as demonstrated for 
inteins (Cotton, G. J. et al^ 1999, J: Am. Chem. Soc, 121, 1100-1101; Cotton, G. J. et al., 

20 1999, Chem. Biol., 6, R247-R256; Muir, T. W. et al., 1998, Proc. Natl. Acad. Sci. U.S.A., 
95, 6705-6710; Xu, R. et al, 1999, Proc. Natl. Acad. Sci: U.S.A., 96, 388-393). Further, 
the engineering of an on/off switch for tiie acti\dty of .fragmented proteins by designing a 
leucine zipper heterodimer which can be reversibly 'assembled or disassembled by 
controlling the environmental conditions is also contemplated (Zutshi R. et al, 1998, 

25 Curr. Opin. Chem. Biol, 2, 62-66; Yao, S. et al, 1998, Nature, 396, 447-450; Krylov, D. 
et al, 1994, EMBOJ.. 13, 2849-2861). 
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C. GRIP Assay and Combinatorial SftWt^r^n 

The present invration is also based on the selectivity of the GFP reassembly 
mediated selection of interaction proteins. Based on this selectivity, the present invention 
developed the GRIP assay (GFP reassembly mediated selection of interacting proteins or 
peptides) and applied the assay to the in vivo colorimetric selection of complementary 
leucine zipper pairs from combinatorial libraries. 

Specifically, the inventors having established that the GRIP assay was selective 
for high afiBnity LZ (leucine zipper) pairs, tested the appHcability of the assay in the 
combinatorial selection of LZ pairs that would interact strongly enough to promote GFP 
reassembly (Figure 5A). This would extend the GRIP system for selection of protein 
partners as had been demonstrated for other fragment reassembly systems (Pelletier, J.N. 
et al, 1999, Nat. Biotechnol. 17, 683-690). A simple experiment in which the acidic LZ 
containing N-terminal GFP fragment (EE-NGFP) was kept constant was chosen. A 
library of LZ partners that could either code for Glu or Lys with equal probability at the e 
and g "specificity" positions (Figure 5A) was geherated. This Ubraiy was fused to the C- 
terminal GFP fragment (XX-CGFP). The plasmid encoded Ubrary of XX-CGFP and EE- 
NGFP were cotransformed into host cells, and colonies that exhibited fluorescence were 
selected and analyzed by sequencing. As expected, there was an overall enrichment of 
Lys residues as the selected pamer for complemfentihg the acidic EE-NGFP, The 
electrostatic pairing of Lys/Glu is required for stabilizing the leucine zipper. 

The present invention demonstrates that the GRIP assay is selective for specific 
protein pairs in vivo and is amenable for the selection of eomplfementary protein pairs in 



VIVO. 



p. GRIP Assay and it?? TTse in Petection of InhihUnrs or P rotein-Prnte^^ 
Interactions ; 

The present invention is further based in part on the discovery that the GRIP assay 
is useful for assaying the disruption of protein-protein interactions in vitro. The GRIP 
can be utiUzed for identifying inhibitors of protein-protein interactions. SpecificaUy, a 
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LZ peptide (SEQ ID NO: 1) was incubated with NGFP/CGFP complex. The sample was 
monitored for fluorescence as a function of added peptide (Figure 6). The LZ peptide 
(SEQ ID NO: 1) prevented the assembly of the complex (4 \xM) with an IC50 value of 31 
|aM. Control experiments with addition of either NGFP or GGFP fragments that lacked 
5 leucine zippers did not prevent reassembly of NZGFP/CZGFP complex (Figure 6). 

P.. Applicati ons of the GRIP Assay 

The present invention is based on the development of a visually detectable 
colorimetric system for studying the assembly and disassembly of protein partners. This 
10 system can be used for high-througlq)Ut screening, for example, screening using 

fluorescence activated cell sorting in yeast (Winson,.M. K. et al^ 2000, Methods, 21, 231- 
240 (2000).). Further, the system can be practiced using protein three-hybrid detection 
system, with two interacting proteixis fiised to respective fragments of a donor GFP 
variant and a third protein fused to an acceptor GFP variant, thus allowing for in vivo 
1 5 fluorescence resonance energy transfer measurements (Tsien, R. Y., 1998, Annu. Rev. 
Biochem., 67, 509-544; PoUok, B. A. et al, 1999, Trends CellBioL, 9, 57-60). 

In the GRIP assay, the emitted light can be analyzed by visual screening, a flow 
sorter (FACS), a spectrophotometer, a nodcrotiter plate reader, a charge coupled devise 
(CCD) array, a fluorescence microscope, or other similar devices. 
20 The GRIP assay may be performed hi using a multiwell format. Typically, wells 

are arranged in two dimensional linear arrays with greater than 864 wells on a standard 
microtiter plate footprint. Other commonly used nunibers of wells include 1536, 3456, 
and 9600. Well volimies typically vary from 500 rianoliters to over 200 microUters, 
depending on well depth and cross sectional area. Well volumes of 1, 2, 5, 10, 20, and 50 
2.5 microliters are commonly used. Wells can be made in any cross sectional shape (in plan 
view) including, square, round, and hexagonal and combinations thereof Wells can be 
made in any cross sectional shape (in vertical view), including shear vertical walls with 
flat or round bottoms, conical walls with flat or roimd bottoms and curved vertical walls 
with flat or round bottoms and combinations thereof. 
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US Patent 6,229,603 provides multi-well plates with greater than 864 wells that 
comprise a layer of cycloolefin having low fluorescence and high transmittance. These 
multi-well plates are particularly well suited for fluorescence measurements. 

The GRIP assay may be used to study protein-small molecule interactions. 
5 Alternatively, the assay may be used to investigate protein-protein interactions and to 
screen libraries for identification of binding molecules. Examples of protein-protein 
interactions include, but are not limited to, antigen/antibody, ligand/receptor, antagonist 
or inhibitor/protein, binding protein/protein, and enzyme/substrate. 

Further, the GRIP assay may be used to investigate other macromolecular 
10 interactions. A known DNA or RNA binding protein, "A" (that binds a RNA or DNA 
sequence "X"), is fused to one firagment of GFP, for example NGFP, and a second 
putative RNA or DNA binding protein fiom library "Z" is fused to, for example CGFP. 
In an in vivo or in vitro system the DNA or RNA component ('X-Y") that is being 
assayed for will have the DNA or RNA sequence "X" attached to a second DNA or RNA 
15 sequence Y whose protein target is being sought" fiom library "Z". Whea binding or 
complexing occurs between "X** and A-NGFP and "Y" with a protein fiom the X-CGFP 
library, fluorescence will be established. 

Variations upon this can be used to identify carbohydrate-protein partners or small 
molecule protein partners by making appropriate changes in the NGFP fused protein A 
20 (which can be chosen to bind carbohydrate or small iholecule components). 

This assay may also be used lo investigate Ubraries of DNA, RNA, carbohydrates, 
peptides or other small molecules. In this situation "X-Y" can be a Ubrary. "X" is held 
constant with a known DNA, RNA, carbohydrate, or small molecule that binds a protein, 
"A", and •'Y" can be varied as desired. The fusion proteins A-NGFP and Z-CGFP can 
25 also be held constant. "Y*' is to identified and is a molecule that binds Z-CGFP. 

Establishing fluorescence will indicate identification of a DNA, RNA, carbohydrates, or 
small molecules component Y that binds protein Z. 

F. Combinatorial T.ibraries 
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A combinatorial chemical library is a collection of diverse chemical compounds 
generated by eitfier chemical synthesis or biological synthesis, by combining a number of 
chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library such as a polypeptide Ubrary is formed by combining a set of chemical 
5 building blocks (amino acids) in every possible way for a given compound length (i.e., 
the number of amino acids in a polypeptide compoimd). Millions of chemical compoxmds 
can be synthesized through such combinatorial mixing of chemical building blocks. 

Preparation and screening of combinatorial chemical libraries is well known to 
persons of skill in the art. Such combinatorial chemical libraries include, but are not 
10 lunited to, peptide Ubraries (see, e.g., U.S. Pat. No. 5,010,175, Furka, 1991, Int. J. Pept 
Prot Res. 31 y 487) and Houghton et aL, 1991, Nature 354, 84). Other chemistries for 
generating chemical diversity libraries can also be used. Such chemistries include, but 
are not limited to: peptoids (PCT PubUcation No. WO 91/19735), encoded peptides (PCX 
PubUcation WO 93/20242), random bio-ohgomers (PCT Publication No. WO 92/00091), 
1 5 benzodiazepines (U.S. Pat. No. 5,288,5 14), diversomers 'such as hydantoins, 

benzodiazepines and dipeptides ( Hobbs et al, 1993, Proc. Nat. Acad. Sci. USA 90, 6909), 
vinylogous polypeptides (Hagihara et al.^ 1992, /. Amer. Chem, Soc. 1 14, 6568), 
nonpeptidal peptidomimetics with P-D-glucose scaffolding (Hirschmann et ai, 1992, J. 
Amer. Chem. Soc. 114, 9217), analogous organic syntheses of small compound libraries 
20 (Chen et aL, 1994, J. Amer. Chem, Soc. 116, 2661), oligocarbamates (Cho et aU 1993, 
Science 261, 1303), and/or peptidyl phosphonates (Campbell et aL, 1994, J. Org. Chem. 
59, 658), nucleic acid libraries, peptide nucleic ^cid libraries (U.S. Pat. No. 5,539,083), 
antibody libraries (Vaughn et al, 1996, Nature Biotechnology 14(3), 309 and 
PCT/US96/10287), carbohydrate libraries (Liang et aL, 1996, Science 274, 1520 and U.S. 
25 Pat. No. 5,593,853), small organic molecule libraries' (benzodiazepines, Baum, C&EN 
January 18, page 33 (1993); isoprenoids, U.S. Pat. No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Pat. No. 5,549,974; pyrrolidines, U.S. Pat. Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Pat No. 5,506,337; benzodiazepmes, U.S. Pat. 
No. 5,288,514, and the like). 
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Devices for the preparation of combinatorial libraries are conimercially available 
(see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, LouisviUe Ky., Symphony, Rainin, 
Wobum, Mass., 433A Applied Biosystems, Foster City, Calif., 9050 Plus Millipore, 
Bedford, Mass.). In addition, numerous combinatorial libraries are themselves 
5 commercially available (see, e.g„ ComGenex, Princeton, NJ., Asinex, Moscow, Ru, 
Tripos, Inc., St. Louis, Mo., ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, 
. Pa., Martek Biosciences, Colimibia, Md, eta). 

The small molecules of a small molecule combinatorial hbrary may be selected 
from at least one of the group consisting of amino acids, peptides, oligonucleotides, and 
10 heterocyclic compounds. The present invention contemplates contemplates combinatorial 
Ubraries of small molecules that are naturally occurring or synthetic. 

Suitable peptides comprise as few as two amino acids to as many as about 30; 
preferably, suitable peptides comprise from about two amino acids to about fifteen; most 
preferably, suitable peptides comprise from about two amino acids to about ten. Any 
1 5 amino acid may be incorporated into peptides screened and identified using the present 
invention, including any combination of the naturally occurring proteinogenic amino 
acids as well as amino acids not naturally occurring in proteins such as, but not limited to, 
dextrorotatory forms of the known amino acids, for example. 

Suitable oligonucleotides consist of as few as* two nucleotides to as many as about 
20 50; preferably, suitable oligonucleotides consist of fix>m about five nucleotides to about 
30; most preferably, suitable oligonucleotides consist of from about five oligonucleotides 
to about 1 5. Any nucleotide may be incorporated into an oligonucleotide to be screened 
and identified using the present invention, including any combination of the naturally 
occurring deoxyribonucleotides and ribonucleotides as well as those not naturally 
25 occurring in biological systems, such as, but not limited to, H-phosphonate derivatives, 
N-blocked-5'-0-DMT-deoxynucleoside3'-(2-cyanoethyl-N,N- 

diisopropyl)phosphoramidites, N-blocked-5*-0-DMT-deoxynucleoside 3'-(2-cyanoethyl« 
N,N-diisopropyl)phosphoramidites, N-blocked-5'-0-DMT-deoxynucleoside 3'.(methyl- 
N,N-diisopropyl) phosphoramidites, N-blocked-5'-0-DMT-deoxynucleoside 3'-(2. 
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chlorophenyl) phosphates, N-blocked-5 -O-DMT-deoxynucleoside 3'-(2-chlorophenyl 2- 
cyanoethyl) phosphate, all of which are nucleoside derivatives used in oUgonucleotide 
synthesis. 

Suitable heterocyclic compounds consist of^ at minimum, a single four membered 
5 ring to as much as a multiple of four membered or greater membered rings coupled by 
carbon chains of 1 to about 20 atoms in length, such chains being saturated or not. 
Preferably, suitable heterocyclic compounds include a single four- to seven-membered 
ring, as well as, but not limited to varying combinations of 5, 6, or 7 membered rings 
having varying numbers of N, S, or O atoms. Examples of suitable heterocyclic 
10 compounds include benzodiazepine and derivatives thereof (as, for example, disclosed in 
Bunin et al.^ 1992, J, Am. Chem. Soc. 114, 10997),.penicillins, cephalosporins, and folate 
derivatives. 

For ease of identification, the molecules in a small niolecule combinatorial library 
may be tagged for decoding their identity. 

15 The GRIP assay may be used to screen mixed libraries. Mixed libraries of small 

molecules comprising amino acids, peptides, oligonucleotides, and heterocyclic 
compoimds that are 5'-hydroxyl derivatives of the oligonucleotides may be used. The 
peptide end of members of a peptide library can be modified to include a carboxyl group. 
A process of esterification of the carboxyl group -with the 5'-hydroxyl of the 

20 oligonucleotide is used to produce a mixed library containing peptide-oligonucleotide 
species. Brermer et aLj (1992, Proc. Nat'l Acad. ScL USA 89, 5381) also describes a 
method of preparation of mixed libraries having nucleotides and peptides. A mixed 
library comprising a heterocyclic compound and a peptide is also prepared by the reaction 
of suitable fimctional groups present on the heterocydic compound. For instance, the 

25 carboxyl group on a heterocyclic compound is reacted with the amino group on the 
peptide to provide an amide linkage. 

Preferably, the GRIP assay of the present invention is used to screen peptide 
libraries. A comprehensive review of various types of peptide libraries can be found in 
Gallop et aL, 1994, J. Med. Chem. 37:1233-1251. The use of peptide libraries is well 
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known in the art. Peptide libraries have generally been constructed by one of two 
^proaches. 

Li the first approach, peptides have been chemically synthesized in vitro in several 
formats. For example, Fodor et al. (1991, Science 251. 767) describe use of complex 
5 instrumentation, photochemistry and computerized inventory control to synthesize a 

known array of short peptides on an individxial microscopic sUde. Houghten et a/. (1991, 
Nature, 354, 84) describe mixtures of free hexapeptides in which the first and second 
residues in each peptide were individually and specifically defined. Lam et al. (1991, 
Nature 354, 82) describe a "one bead, one pq)tide" approach in which a soUd phase split 

10 synthesis scheme produced a library of peptides in which each bead in the collection had 
immobilized thereon a single, random sequence of amino acid residues. 

hi the second approach, peptides are expressed m biological systems as either 
soluble fiision protems or viral cqisid fiision proteins. A number of peptide libraries have 
been generated using the M13 phage. M13 is a filameatous bacteriophage that has been 

1 5 routinely used in molecular biology laboratories for the past 20 years. M13 viral particles 
consist of six different capsid proteins and one copy of the viral genome, as a single- 
stranded circukuT DNA molecule. Once flhe M13 DNA has been mtroduced into a host 
cell such as E. coli, it is converted mto double-stranded, circular DNA. The viral DNA 
carries a second origin of replicatioii th^ is used to generate the single-stranded DNA 

20 found in the viral particles. During viral morphogenesis, there is an ordered assembly of 
the single-stranded DNA and flie viral proteins, and the viral particles are extruded fiom 
cells in a process much hke secretion. The M13 vuiis is ndther lysogenic nor lytic like 
other bacteriophage (e.^., X); cells, once infected, chronically release virus. This feature 
leads to high titers of virus in infected cultures, i.e., lO'^ pfii/ml. 

25 In a preferred embodiment, a GFP peptide comprising a fragment of GFP is fused 

to a random peptide to form a fusion polypeptide. By "fused" or "operably linked" herein 
is meant that the random peptide and the GFP, are linked together, in such a manner as to 
mmimize the disruption to the stabiUty of the GFP structure, /.& it retains fluorescence. 
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The GFP fusion polypeptide of the present invention can comprise further components 
such as linkers or fusion partners. 

The peptides (and nucleic acids encoding them) are randomized, either fiiUy 
randomized or they are biased in their randomization, e.g. in nucleotide/residue frequency 
5 generally or per position. As used herein "fiiUy randomized** means that each nucleic 

acid and peptide consists of essentially random nucleotides and amino acids, respectively. 
The nucleic acids which give rise to the peptides are chemically synthesized, and thus 
may incorporate any nucleotide at any position. Thus, when the nucleic acids are 
expressed to form peptides, any amino acid residue may be incorporated at any position. 
10 The synthetic process can be designed to generate randomized nucleic acids, to allow the 
formation of all or most of the possible combinations over the length of the nucleic acid, 
thus forming a library of randoiiiized nucleic acids. 

Alternatively, the peptide library is biased. In this case, some positions within the 
sequence are either held constant, or are selected from a limited number of possibilities. 
1 5 Individual residues may be fixed in the random peptide sequence to create a stractural 
bias. For example, proline or bulky residues such as W, R, K, L, I, V, F or Y may be 
inserted to restrict the conformation of flie peptide. Also; the library can be biased to a 
particular secondary stmcture such as the alpha^^helical stmcture. Examples of helix 
forming residues include M, A, K, L, D, E, R, Q, and V, 
20 In a preferred embodiment, the bias is toward peptides that interact with the 

known classes of molecules. For example, it is known that SH-3 peptides binds to SH-3 
proteins. A large number of small molecule domains are known that are suitable as 
starting points for the generation of biased randomized peptides. Examples of such 
molecules, domains, or consensus seqi^ences include, but are not limited to SH-2 
25 domains, SH-3 domains, pleckstrin, deatti domains, protease cleavage/recognition sites, 
enzyme inhibitors, enzyme substrates, and Traf., and leucine zipper consensus sequence. 

As discussed above, a fusion partner or lihker can be added to fuse the random 
peptides to a GFP peptide. Fusion partners or linkers can be synthetic or heterologous 
(not native to the host cell). Appropriate fusion partners include, but are not limited to 
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peptides that are stability sequences that stabilize and protect the random peptide from 
degradation, linker sequences for decoupling the random peptide from the GFP fragment, 
structural sequences that restrict and stabilize the conformation of the random peptide, 
targeting sequences which allow localization of the peptide into a subcellular or 
5 extracellular compartment, and rescue sequences that allow the purification or isolation of 
the random peptide. 

In light of the foregoing general discussion, the specific examples presented below 
are illustrative only and are not intended to limit the scope of the invention. Other 
10 generic and specific configurations will be ^parent to those pereons skilled in the art. 

EXAMPLES 
General Materials and Methods 
GFP Variant: A variant of the naturally occurriiig GFP, which has a single excitation 
15 maximum at 475 nm was chosen for dissection and reassembly. The GFP variant (sglOO) 
contains F64L, S65C, Q80R, Y151L, I167T and K238N mutations fit)m wild type GFP, 
which leads to a single fluorescence excitation and emission maximum at 475 nm and 505 
mn respectively, similar to GFP-sg25 as described by Pahn, G. J et al, 1997, Nat. Struct. 
Biol., 4, 361-365. 

20 

Cloning and Purification Protocol for NZGFP, NGFP, CGFP, and CZGFP: The 
NZGFP, NGFP, CGFP and CZGFP coding DNA were obtained by PGR amplification of 
the GFP (sglOO) plasmid template using appropriate primers. The DNA fragments were 
cut with Nhel/BamHl and Ugated into the pETl 1 a vector. The DNA sequences of the 
25 NZGFP, NGFP, CGFP, and CZGFP containing clones were verified by 

dideoxyoUgonucleotide sequencing at the Keck faciUty at Yale. The protein products 
were overexpressed in BL21(DE3) cells at 37 °C without IPTG induction. The cells were 
lysed by sonication and the proteins were individually purified by passage over 2 
successive Q-sepharose columns and then over a Gel-filtration column. Fractions 
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containing the protein of interest, as determined by SiDS-P AGE, were pooled and 
dialyzed against 2niM DTT, 10 mM TrisHCl buffer at pH 7.2. Final purified yields of 
proteins were between 10-20 mgf L. Protein molecular weights were verified by MALDI 
mass spectrometry to within 0.05% of the calculated molecular weight. Amino acid 
5 analysis of the proteins established the correct compositions and protein concentrations 
for further biophysical studies. 

Amino acid sequences ofNGFP, NZGFP. CGFP. and CZGFP: Leucine zippars are in 
bold and linker regions unda:lined. Note the 6 residue linker between the C-tenninal of 
10 NGFP and NZ and the 4 residue linker betwerai CGFP and CZ. 

NGFP 

MASKGEEUTTGVWILVELDGDVNGHKFSVSGEGEGDATYGKLTI^ 
WWFIIAriTLCVrGVQCTSRYPDHMKiaroFFKSAMPEGYVQERTIFI^ 
15 P_AFVFFF^r>T^ AnMPTPT TrnTmnnpnmsiTT avnci .P.VNY>jmsrvT.TMADK O(?ay(?/?G 
(SEQIDNO:3) 

f^GFP 

MASKGEELFTGVWILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLP 
20 WWFTLVTTLCYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTIFFKDDGl^^ 
p A T?WT7T?rtr>TT AmPTFT TT nTnFTCFDONTT GTTKT .P.YNYNHNVT.TM ADK QGG^G^G 
ALKKELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 4) 

CGFP 

25 MASi2i2SiZKNGIKVNFKIRHNIEDGSVQLAr)HYQQNTPIGDGPVLLPD 
SAI^KDPl^KIUDHMVLLEFVTAAGITHGMDELYN (SEQIDNO: 5) 

CZGFP 

MASEQLEiaa.QALEiaaAQO:WiaSQALEKia.AQJ2(iSSK^ 
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EDGSVQI^HYQQmPIGDGPVlJ-BDNH\^TQSALSKDPNEKia)H^ 
AAGrraGMDELYN (SEQ ID NO: 6) 

Additional Constructs/or GFP Reassembly: DNA constructs for EE-NGFP and KK- 
5 CZGFP coding DNA were obtained by PGR amplification of the GFP (sglOO) plasmid 
tenq>late using appropriate primers encoding the leucine zippers KK and EE whose 
sequences are AQLKEKLQALKEKLAQK WKLNALKEKLAQ (SEQ ID NO: 7) and 
ALEKELQANEKELAQLEWELQALEKELAQ (SEQ ID NO: 8) respectively. The 
DNA fragments were digested with Nhel/BamHl (New England Biolabs) and Ugated into 
10 the pETUa vector. The DNA sequences of the EE-NGFP, and JOC-CGFP containing 
clones were verified by automated sequencing at the Keck facility at Yale. 

Constructs for Libraty Selection: For leucine zipper library construction two overlapping 
degenerate oUgonucleotides containing NAG (N= G or A) at all positions correspondmg 

15 to Lys in the leucine zipper of KK-CGFP were Synthesized such that they would code for 
either Lys or Glu with equal probabiUty. The two overl^mg oUgonucleotides were 
mutually primed and extended using T7 Sequenase <AmereKam) with 10 mM dNTPs. 
The product was purified &om an agarose gel and subsequently ligated into flie Nhel- 
Dram ^ew England Biolabs) cassette present in a previously cut KK-CGFP plasmid. 

20 The resulting Kbrary. XX-CGFP, was transformed m 5 x 50 of electrocompetent 

XLl-Blue ceUs (Stratagene) and selected for ampicilliii resistance. The resulting pool of 
XX-CGFP plasmids was sequenced to verify that G/A were equally represented at sites of 
randomization. 

25 Colorometric Selection: For all reassembly experiments with NZGFP/CZGFP, 

NZGFP/KK-CGFP. EE-NGFP/CZGFP, and EE-NGFP/KK-CGFP: 1 ^g of each plasmid 
was cotransfoimed in 30 hL of BL21(DE3) cells and selected on ampicillin containing 
LB plates. The plates were incubated at 37 °C overnight and subsequently moved to the 
bench top (23 °C) for 2 day. The green color deyelpped after 16-32 hours. The 
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cotransfonnation efficiency was approximately 7 ± 2 % as yerified by growing up 
individual colonies and monitoring protein expression profiles, which corresponded well 
with visual inspection of green colonies in experiments with NZGFP/CZGFP and EE- 
NGFP/BCK-NGFP. Non-fluorescent colonies that coexpressed either NZGFP/KK-CGFP 

5 or EE-NGFP/CZGFP were identified by screening 120 colonies of respective 

cotransformations by SDS gel for protein expression of both gene products. In library 
selections^ 20 individual cotransformations of l|ig of XX-CGFP library plasmid with 1 
Jig of EE-NGFP plasmid were carried out as described above. Sixteen colonies were 
selected from 102 green colonies of -4000 total colonies. The colonies were grown 

10 overnight in LB media and the plasmid DNA CXX-CGF? + EE-NGFP) purified and 
sequenced using primers unique to the XX-CGFP construct. 

Inhibition of Protein-Protein Interactions: The protein products for NZGFP, CZGFP, 
NGFP and CGFP were overexpressed in BL21 (DE3) cells at 37 ""C and purified as 
described above. Amino acid analysis of the proteins estabUshed the correct compositions 

15 and protein concentrations for fluorescence experiments.. The inhibitor peptide 
corresponding to the leucine zipper of NZGFP (EK peptide) having the sequence 
ALKKELQANKKELAQLKWELQALKKELAQ (SEQ ID NO: 1) was synthesized at the 
Keck facility (Y ale University) and purified on a reverse phase C8 colunm(Vydac) by 
HPLC. Peptide concentrations were detemained by Tip absorbance and verified by ammo 

20 acid analysis. 

For inhibition experiments all fluorescence measurements were made in triplicate 
on a Hitachi F-4500 Fluorescence Spectrophotometer with excitation at 475 nm and 
emission at 505 nm. A 1.2 mM stock solution of equimolar amounts of NZGFP/CZGFP 
was allowed to reassemble and fluoresce until there was no change in fluorescence (36 
25 hours). The reassembled complex was denatured in 4 M GdmHCl for 4 hours following 
which dififerent concmtrations of EK peptide, NGFP, or CGFP were added and the 
NZGFP/CZGFP concentration adjusted to 800 14M. The samples were diluted 200 fold 
into 2 mM DTT, 10 mM Tris.HCl buflfer at pH 12 to a 20 mM final concentration of 
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GdmJICl and allowed to refold and fluoresce. In order to eliminate artifacts from time 
dependent inhibition, fluorescence measiurements were made after 4 hours and after 16 
hours and found to be constant. 

5 Example 1: In Vitro Reassembly of GFP Using an Antiparallel Leucine 

Zipper 

1 . Design of Antinarallel Leucine 7.ij>pe^ ' 

Designs for helices, designated NZ and CZ, to form antiparaUel leucine zippers 
for reassembly purposes were based upon sequences reported by Hodges, (1 la) Kim, 
10 (Oshea. E. K. et aJ., 1993, Current Biol, 3, 658-667) and Alber (Haibuiy, P. B. et al., 
1994, Nature, 371, 80-84). The leucine zippers contained a Leu-rich hydrophobic core, 
acidic (GIu) and basic (Lys) residues to direct antiparallel heteiodimer formation, and 
also incorporated a buried a^aragine residue which disfavors homodimerization by up to 
2.3 Kcal/mol (Figure 1) (Oakley M. G. etal.; 1998, Biochemistry, 37, 12603-12610). 

15 

2. Dissection of OFP 

The variant GFP (sglOO) was dissected at a surface loop between residues 157 and 
158, a position that has previously been shown to accommodate a 20 residue amino acid 
insertion (Abedi, M. R., et al. , 1 998, Nucleic Acid Res., 26, 623-630). The dissection 
20 resulted in N- and C-tenninal fi-agments, designated NGFP and CGFP, containing 157 

and 81 residues, respectively (Figure 1). The NGFP fragment contains the three residues, 
Ser65, Tyr66, and Gly67, that ultimately form the GFP .fluorophore (Tsien, R. Y., 1998, 
Annu. Rev. Biochem., 67, 509-544). 

25 3, /» Vitro Reassemblv of the Dissected nv p Fragments TTsing the Di-rig m^ 

Helices 

The designed heUx, NZ was upended to the C-tenninal of NGFP, via a six 
residue linker, to generate the ftision peptide designated NZGFP. Similarly, CZ was 
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appended to tiie N-terminal residue of CGFP, via a four residue linker, to generate flie 
complementary fusion peptide, CZGFP. 

It was envisioned that if NZGFP and CZGFP were competent to heterodimerize 
via the designed helices, either in vitro or in vivo, the reconstituted GFP protem would 

5 display its characteristic fluorescence, indicating the correct reassembly of the tertiary 
fold from the peptide fragments. The genes encoding the designed protein sequences 
NZGFP, CZGFP, NGFP, and CGFP were cloned and the resulting proteins overexpressed 
and purified using methods routinely practiced by the skilled artisan. 

To investigate the viabiUty of the protein reassembly strategy, a literature protocol 

10 devised for the refolding of denatured GFP was followed (Reid, B. G. et al, 1997, 
Biochemistry, 36, 6786-6791). Thus, equimolar amounts (4 jiM) of the fragments, 
NZGFP and CZGFP, were denatured in 6 M GdmHCl and dialyzed into a buffer 
containing 2 mM DTT, 10 mM phosphate buffer at pH 7.2 over 24 hrs at 4 **C. The 
reassembled peptides were visibly green. Moreover die Xmax for the fluorescence 

15 excitation and emission spectra were identical to that of the parent GFP (Figure 2 inset). 
To verify that the reassembly was indeed guided by the antiparallel leucine zippers, 
control experiments were done with fragments with £ind without die leucine zippers. It 
was found that solutions containing NGFP, CGFP, NGFP/CGFP, NZGFP/CGFP, or 
NGFP/CZGFP did not fluoresce, even at concentrations of over 100 }iM. The apparent 

20 dissociation constant, K<iapp, for the NZGFP/CZGFP complex was determined by titrating 
NZGFP into a solution of CZGFP and monitoring the fliiorescence emission intensity at 
505 nm (Figure 2). The data were fitted to a two-stiate bindiiig isotherm, yielding a Kdapp 
of 31 ± 7 nM and a-analysis of the binding data verified the expected 1:1 stoichiometry 
of NZGFP and CZGFP (Bagshaw, C. R.; et al, 1987, Spectrophotometry and 

25 spectrqfluorimetry: A practical approach, pp 91-113). 

Example 2: In Vivo Reassembly of the Dissected GFP Fragments 
BL21(DE3) JSl coli cells were transformed with equimolar amounts of NZGFP 
and CZCTP encoding plasmids. The ^pearance of green color was monitored to identify 
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cotransfonned colonies expressing reassembled GFP . After 36 hours several of the 
colonies turned green as illustrated in Figure 3a. with a cotransformation efficiency of 
4%. Individual colonies were cultured in liquid media and their protein expression 
pattern analyzed. The green colonies were shown to express similar amounts of NZGFP 
and CZGFP (Figure 3b and 3 c), whereas non-fluorescent colonies were shown to contain 
either NZGFP or CZGFP. Furthermore, control cotransformation experiments with 
NGFP/CGFP, NGFP/CZGFP and NZGFP/CGFP failed to show any green colonies, thus 
emphasizing the requirement for the presence of both NZ and CZ leucine zippers to 
mediate GFP assembly in vivo and in vitro. 



Example 3: Reassembly of Proteins Using Antiparallel Leucine Zipper 

The methods described above for reassembly of GFP in vivo and in vitro may be 
modified for reassembly of any protein of interest, using antiparallel leucine zippers. The 
particular proteins are not critical, so long as they cian be divided into fi-agments fliat 
1 5 produce a detectable signal upon their association, specific bmding, or complexation 
mediated by the formation of an antiparallel zipper with a kndwn biological activity or 
function that can be assayed for in vitro or in vivo, for example^ kinase activity for a 
protein kinase, proteolytic activity for a protease, and DNA binding activity of DNA 
binding protein. 

20 The peptide firagments of the protein of interest are fused to each of the helices 

(SEQ ID NO: 1 and SEQ ID NO: 2) described above. Alternatively, oflier pairs of helices 
that form antiparallel leucine zippers may be designed ahd'fused to the peptide fiagments 
of the protein of interest. 

25 h In KjEVrP Reassembly 

Equimolar amoimts of the fusion peptides comprising peptides of the protein of 
interest and helices that form antiparallel leucine zippeirs are denatured and dialyzed as 
described in Example 1 . The reconstitution of the protein is monitored. 
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2, In Vivo Reassembly 

Equimolar amounts of plasmids encoding the fusion peptides are transformed in 
host eucaryotic or procaryotic host cells as described in Example 2. The cotransformed 
colonies expressing reassembled protein are identified. 

5 

Example 4: The GRIP Assay 

In order to test the specificity of the GRIP assay, a set of four possible LZ 
combinations which were either electrostatically matched (EE-NGFP/KK-CGFP and 
NZGFP/CZGFP) or mismatched (EE-NGFP/CZGFP and NZGFP/KK-CGFP) (Fig, 4; 

10 Bryson, et aU 1995, Science 270, 93^5-941, Oakley et al, 1998, Biochemistry 37, 12603- 
12610) were designed. The GRIP assay would only allow the matched pairs to 
competently fold and catalyze fluorophore formation in GFP. 

Escherichia coli (BL21) cells were cotransformed with plasmids encoding the 
proteins of interest and plated on ampicillin containing plates. Fluorescent colonies were 

1 5 observed only in the complementary pairs (EE-NGFP/KK-CGFP and NZGFP/CZGFP). 
No visible fluorescence was observed in colonies containing the uncomplementary pairs 
(EE-NZGFP/CZGFP and NZGFP/KK-CGFP). Since the electrostatically mismatched 
pairs have a dissociation constant, Kd, of - 100 |iM (Y ao, S., et al.^ 1998, Nature 396, 
447-450), this experiment sets an initial lower visual limit for detecting protein-protein 

20 interactions using the GRIP assay. 

Example 5: GRIP Assay and Combinaitorial Selection 

A library of LZ partners that could either code for Glu or Lys with equal 
probability at the e and g "specificity" positions (Fig. 5A) was generated (Oakley, M. G. 
25 et al, 1998, Biochemistry 37, 12603-12610; Pelletier, J.N. et al, 1999, Nat Biotechnol. 
17, 683-690; Dmitry, K., et al, 1994, EMBO X 13, 2849-2861). This library was fused to 
the C-terminal GFP fragment (XX-CGFP). It is thought that the selected partners would 
be enriched in Lys in order to complement the acidic EE-NGFP. 
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The 256-memberplasmid-encoded library of XX-CGFP was cotransformed with 
EE-NGFP and selected colonies that exhibited fluorescence. The protein expression 
profiles of the two protein fragments, XX-CGP and EE-NGFP, were virtually identical in 
cotransformed cells (Fig. 5B), thus excluding differences in relative protein concentration 
5 as a major determinant of the observed fluorescence. Sixteen of the multiple colonies 
exhibiting fluorescence were sequenced. The results of the selection are summarized in 
Figure 5C. The selected L2 partners of EE-NGFP displayed an overall 3:1 ratio of 
Lys:Glu residues, with the fewest Lys residues being 5 and the most being 7. Thus, an 
overall enrichment of Lys residues was observed as predicted from the requirement for 

10 electrostatic pairing of Lys/Glu for stabiUzing the leucine zipper. Assuming an average 
value of 0.85 kcal/mol penalty for each Glu-Glu pair relative to Lys-Glu pair based on 
Uterature precedence, (Dmitry, K., et al., 1994, EMBOJ. 13, 2849-2861; Zhou, N. E. et 
al., 1994. Protein Eng. 7, 1365-1372), a 75 fold difference in Kj between the best with aU 
Lys (Kd = 33 nM) (Ghosh, I. et al., 2000, J. Am. Chem. Sod 122. 5658-5659) and worst 

15 (5 Lys and 3 Glu) leucine zipper partners for EE-lSTGFP Vas estimated. Thus, this 

experiment sets a lower threshold for the visual detection of interacting proteins in the 
GRIP assay to ~ 2.5 jiM, which is within the observed dissociation constant for most 
specific protein partners. 

20 Example 6: Inhibition of Protein-Protein Interactions 

To verify that the GRIP assay could be titiUzed for detecting inhibitors of protein- 
protein interactions (Zutshi, R., et al., 1998, Curr. Opin. Chem. Biol. 2, 62-66), a LZ 
peptide corresponding to the LZ present in NZGFP (Fig. 4) was synthesized. The 
reappearance of fluorescence of the disassembled NZGFP/CZGFP complex in an in vitj-o 

25 assay as a function of added peptide was monitored (Fig. 6). The EK peptide (SEQ ID 
NO: 1) prevented the assembly of the complex (4 nM) with an IC50 value of 31 nM. 
Control experiments with addition of either NGFP or CGFP fragments that lacked leucine 
zippers did not prevent reassembly of NZGFP/CZGFP complex (Fig. 6). It is worth 
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noting that disassembly of an existing GFP complex was not achievable even at > 1 mM 
added peptide inhibitor. 

Example 7: GRIP Assay and Identification of Binding Partners Via 
5 Combinatorial Selection 

The GRIP assay may be modified by substituting the helices that form antiparallel 
leucine zippers with test proteins or peptides to determine whether a test protein or 
peptide attached to one portion of GFP interacts witii another test protein or peptide 
attached to the other portion of GFP. The test proteins can be any protein. As an 
10 example, an orphan receptor can be fused to one portion of the GFP, while test ligands 
can be fused to the second portion of GFP . 

Specifically, nucleic acid encoding a fusion protein comprising an orphan receptor 
and a first portion of the GFP and a plasmid library effusion proteins comprising test 
Ugands and the second portion of GFP can be cotransfected or cotransformed into host 
15 cells. Colonies exhibiting fluorescence are selected, since tiiey contain GFP molecules 
that have been properly folded or reassembled, and test ligands that interact with the 
orphan receptor. The colonies can be further cultured and investigated to determine the 
structural properties of the hgand. The molecular weight of the ligand may be 
determined by SDS-PAGE, and the primary stracture may be determined by amino acid 
20 sequencing; *. ' . ' 

Examples of orphan receptor groups include but are not limited to CCRL2, 
CMKLRl, CMKRL2, GPR31, HM74, and RDCl. Specific exanq)les of orphan receptors 
of each group include but are not limited to: 1) CCRL2: chemokine (C-C motif) 
receptor-like 2, HCR, CRAM-B, CKRX, CRAM-A, lipopolysaccharide inducible C-C 
25 chemokine receptor related, EOl ; 2) CMKLRl : chemokine-like receptor 1, ChemR2S, 
CMKRL3, DEZ, CMKLRl, LOC60669: G-proteiii coupled chemoattractant-like 
receptor; 3) CMKRL2: chemokine receptor-like 2, CMKRL2, FEG-1, GPCR-BR, 
DRY12, CEPR, GPR30, GPR41; 4) GPR31: G protein-coupled receptor 31, GPR31, 
Gpr3 lb; 5) HM74: putative chemokine receptor, GTP-binding protein; and 6) RDCl : 
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chemokine orphan receptor, D2S87E, GPRNl, CMKORl, canine oiphan receptor RDCl 
homolog, chemokine orphan receptor 1, Rdcl. 

The GRIP assay may be modified to detect macromolecular interactions, for 
example, specific protein-protein interactions, both in vitro and in vivo. When the 
5 proteins attached to the two OFF fragments associate with each other, the two OFF 
fragments will properly reassemble and fluoresce. Thus, in the absence of association, 
the proteins attached to the OFF fragments do not fluoresce. Fluorescence of interacting 
protein pairs linked to NGFF and CGFF can provide a sensitive assay for detecting the 
affinity and specificity of the individual protein pairs (and their mutants) under 
10 investigation. 

Examples of protein-protein interactions include, but are not limited to, 
antigen/antibody, ligand/receptor, antagonist or inhibitor^rotein, binding protein/protein, 
and enzyme/substrate. Specific protein-rprotein interactions involved in disease and 
identified as potential dmg targets include examples such as Bax/Bcl-2 (Sartorius, et al, 
15 2001, Chembiochem. 2 (1), 20), p53/mdm2 (Moll et al. 2000, Drug Resist Update, 3 (4), 
217)), VEGF/VEGF-R (Plate et al, 1992, Nature, 359, 845), IL.6/IL.6R (Akira et al, 
1993, Adv. Immunol , 54, 1), Ras/Raf (Weinstein-Oppenheimer et al, 2000, Pharmacol 
7%er., 88(3), 229). 

Examples of other macromolecular interactions include, but are not limited to, 
20 nucleic acid-nucleic acid binding protein interactions and carbohydrate-protein 
interactions. 

Example 8: GRIP Assay and Identification of Inhibitors Via Combinatorial 
Selection 

25 The GRIP assay may be modified to identify inhibitors of a specific protem- 

protem interaction. For example, a receptor can be fiised to a portion of GFP, while a 
ligand can be fiised to a second portion of GFP. A test inhibitor, such as a test antagonist, 
can be incubated with the two GFP fiision proteins comprising the ligand and receptor to 
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see if it prevents the reassOTibly of GFP which can be detected by the loss of 
fluorescence. 

Specifically, nucleic acid encoding a fusion protein comprising a known receptor 
and a first portion of the GFP, nucleic acid encoding a fusion protein comprising its 

5 known ligand and the second portion of GFP, and a plasmid library of test antagonists can 
be cotransfected or cotransforaied into host cells. Colonies fliat do not exhibit 
fluorescence are selected, since they contain GFP molecules that have been prevented 
from folding or reassembly and test antagonists that inhi]bit the interaction of the known 
receptor with its ligand. The colonies can be further cultured and investigated to 

1 0 detemiine the structural properties of the Ugand. The molecular weight of the Ugand may 
be detemiined by SDS*PAGE, and the primary structure may be determined by amino 
acid sequencing. 

It should be understood that the foregoing discussion and examples merely present 
15 a detailed description of certain preferred embodiments. It therefore should be apparent 
to those of ordinary skill in the art that various modifications and equivalents can be made 
without departing firom the spirit and scope of the invention. All journal articles, other 
references, patents, and patent applications that are id^tified in this patent s^lication are 
incorporated by reference in their entirety. 
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Claims 



1 . A protein complex comprising a first and second peptide, each of said 
peptides being joined to a heterologous helical domain, said heUcal domains being 

5 noncovalently associated to form an antiparallel leucine zipper. 

2. The protein complex of claim 1 , wherein said peptides form a signaling 
moiety while complexed. 



10 

joined to said helical domains via a linker. 



3. The protem complex of claim 1, wherein said first and second peptides are 



4. The protein complex of claim 1, wherein each of the first and second 
pq)tides comprises a distinct portion of green fluorescent protein (GFP). 

15 

5. The protein complex of claim 1, wherein each of tiie helical domains 
coiiq)rises an amino add sequence as set forfli in SEQ ID NO: 1 or SEQ ID NO: 2. 

6. A nucleic acid encoding a fusion protein comprising a peptide and a 
20 helical domain, said heKcal domain forming an antiparallel leucine zipper when it 

noncovalently associates with a complementary hdlical domain. 

7. The nucleic acid of claim 6, wherein the fusion protein, further comprises a 
linker moiety interposed between the peptide and the helical domain. 

25 

8. The nucleic acid of claim 6, wherein the peptide comprises a peptide of 
green fluorescent protein (GFP). 
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9. A fusion protein comprising a peptide and a helical domain, said helical 
domain forming an antiparallel leucine zipper when it noncovalently associates with a 
complementary helical domain. 

10. The fusion protein of claim 9, wherein the fusion protein further comprises 
a Unker moiety interposed between the peptide and the helical domain. 

1 1. The fusion protein of claim 9, wherein the peptide comprises a peptide of 
green fluorescent protein (GFP). 

12. A method of assembling a proteiii complex comprising the steps of: 

(a) providing first and second helical domains that non-covalently associate to 
form an antiparallel leucine zipper; 

(b) providing first and second peptides; 

(c) producing fusion proteins by separately fusing said first helical domain to said 
first peptide and said second helical domain to said second peptide; and, 

(d) allowing the fusion proteins to form a protein complex iiiediated by the non- 
covalent association of the first and second helical domains into an antiparallel leucine 
zipper. 

13. The method of claim 12, wherein the protein complex comprises a 
signaling moiety. 

14. The method of claim 12, wherein each of the helical domains comprises a 
25 leucine rich hydrophobic core. 

15. The method of claim 14, wherein each of the heUcal domains further 
comprises acidic residues and basic residues. 
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1 6. The method of claim 1 5, wherein each of the helical domains further 
comprises a buried asparagine residue. 

1 7. The method of claim 12, wherein the pair of helical domains has the amino 
5 acid sequences as set forth in SEQ ID NO: 1 and SEQ ID NO: 2. 

18. The method of claim 12, wherein the step of producing the fusion proteins 
further comprises interposing a linker moiety between the peptide and the helical domain. 

0 19. The method of claim 12, wherein the distinct peptides are derived from 

GFP. 



20. A method of identifying a polypeptide that interacts with a known 
polypeptide comprising the steps of, 

^ 5 (^) producing a first fusion protein comprising the known polypeptide linked 

to a first GFP fragment; 

(b) producing a second fusion protein comprising a test polypeptide linked to 
a second GFP firagment, wherein association of the first and second GFP fragments 
results in a GFP that exhibits detectable fluorescence; 
20 (c) allowing the first fusion protein to associate with the second fusion protein 

to form a complex mediated by the non-covalent Association of the known polypeptide 
and test polypeptide; and, 

(d) detecting association of GFP, whereiii association of GFP indicates that 
the test polypeptide interacts with the known polypeptide. 

25 

21. The method of claim 20, wherein the fi^rst GFP peptide is NGFP and the 
second GFP peptide is CGFP. 
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22. A method of identifying a polypeptide that interacts with a known 
polypeptide comprising the steps of, 

(a) producing nucleic acid encoding a fusion protein comprising the known 

polypeptide linked to a jSrst GFP fragment; 
5 (b) producing nucleic acids encoding fusion proteins comprising a test 

polypeptide linked to a second GFP fragment, wherein association of the first and 

second GFP fragments results in a GFP that exhibits detectable fluorescence; 

(c) cotransforming or cotransfecting the nucleic acids of steps (a) and (b) into 

a host cell for expression of the mcoded fusion proteins; 
1 0 (d) selecting colonies that exhibit fluorescence; and, 

(e) cultuiing the selected colonies to identify the test polypeptides that interact 
with the known polypeptide. 

23. The method of claim 22, wherein the first GFP peptide is NGFP and the 
15 second GFP peptide is CGFP. 

24. The method of claim 22, wherein the nucleic acids of step (b) are produced 
in the form of a combinatorial library. 

20 25. A method of identifying a molecule that inhibits the activity of a known 

protein comprising, 

(a) producing a first fusion protein comprising a first known polypeptide 
linked to a first GFP fragment; 

(b) producing a second fusion protein comprising a second polypeptide linked 
25 to a second GFP fragment, wherein the second polypeptide is known to interact 

with the first polypeptide and wherein association of the first and second GFP 
fragments results in a GFP that exhibits detectable fluorescence; 
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(c) allowing the first fusion protein to associate with the second fusion protein 
to form a GFP complex mediated by the non-covalent association of the first and 
second polypeptide; 

(d) incubating a test molecule with the GFP complex; and, 

(e) detecting disassembly of the complex, wherein disassembly of the 
complex indicates that the test molecule inhibits the activity of the known protein. 

26. The method of claim 25, wherein the first GFP peptide is NGFP and the 
second GFP peptide is CGFP. 

27. A method of detecting protein-protein interaction comprising, 

(a) producing a first fusion protein comprising a known polypeptide linked to 
a first GFP firagment; 

(b) producing a second fusion protein conoiprising a test polypeptide linked to 
a second GFP firagment, wherein association of the- first and second GFP fiagments 
results in a GFP that exhibits detectable fluorescence; 

(c) allowing the first fusion protein to associate with the second fusion protein 
to form a complex mediated by the non-covalent association of the known polypeptide 
and test polypeptide; and, 

(d) detecting reassembly of GFP, wherein reassembly of GFP indicates that 
the test polypeptide interacts with ibe known polypeptide. 

28. The method of claim 27, whereui ttie method further comprises obtaining 
nucleic acids encoding the first and second fusion protein and cotransfecting or 

25 cotransfomiing the nucleic acids into a cell to obtain the first and second fusion protein. 
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